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This is a FIRST submission of items concerning a filing under 35 U.S.C. 371 . 

This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371 . 

This is an express request to begin national examination procedures (35 U.S.C 371(f))- The submission must include itens (5), (6), 
(9) and (24) indicated below. 

The US has been elected by the expiration of 19 months from the priority date (Article 31). 
A copy of the International Application as filed (35 U.S.C. 371 (c) (2)) 

a. (3 is attached hereto (required only if not communicated by the International Bureau). 

b. □ has been communicated by the International Bureau. 

c. □ is not required, as the application was filed in the United States Receiving Office (RO/US). 
An English language translation of the International Application as filed (35 U.S.C. 371(c)(2)). 

a. □ is attached hereto. 

b. □ has been previously submitted under 35 U.S.C. 154(d)(4). 

Amendments to the claims of the International Application under PCT Article 19 (35 U.S.C. 371 (c)(3)) 

a. □ are attached hereto (required only if not communicated by the International Bureau). 
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c. □ have not been made; however, the time limit for making such amendments has NOT expired. 

d. □ have not been made and will not be made. 
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24. The followine fees are submitted.. 






CALCULATIONS PTO US! ONLY 


BASIC NATIONAL FEE ( 37 CFR 1.492 (a) (1) - (5)) 

□ Neither international picliminaiy examination fee (37 CFR 1 482) nor 
international seaich fee (37 CFR 1 445(a)(2)) paid to USPTO 

and International Search Report not prepared by the EPO or JPO . ... S1040.UU 

£3 International preliminary examination fee (37 CFR 1 4S2) not paid to 

USPTO but International Seaich Rcpoit prcpaicd by the EPO or JPO . bSJu.UU 

□ International preliminary examination fee (37 CFR 1.482) not paid to USPTO 

but international search fee (37 CFR 1 445(a)(2)) paid to USP I O 2> /4U.uu 

□ International preliminary examination fee (37 CFR 1 482) paid to USPTO 

but all claims did not satisfy provisions of PCT Article 33(1 )-(4) S7 10.00 

□ International preliminary examination fee (37 CFR 1.482) paid to USPTO 

and all claims satisfied provisions of PCT Article 33(1 )-(4) SI 00.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 




S890.00 




Surcharge of $130.00 for furnishing the oath or declaration later than □ 20 30 
months from the earliest claimed priority date (37 CFR 1 492 (e)). 


$130.00 




CLAIMS 


NUMBER FILED 


NUMBER EXTRA 


RATE 




Total claims 


35 -20 = 


15 


x S18.00 


S270.00 




Indeoendent claims 


2 - 3 = 


0 


x $84.00 


$0.00 




Mnlfinlp Dependent Claims (check if anolicable) 




S280.00 




TOTAL OF ABOVE CALCULATIONS = 


$1,570.00 




□ Applicant claims small entity status. See 37 CFR 1 27). The fees indicated above are 
reduced by 1/2 


$0.00 




SUBTOTAL = 


$1,570.00 




Processing fee of $130.00 for furnishing the English translation later than □ 20 □ 30 
months from the earliest claimed priority date (37 CFR 1 .492 (f)). + 


S0.00 




TOTAL NATIONAL FEE 


$1,570.00 




Fee for recording the enclosed assignment (37 CFR 1 2 1 (h)) The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3 31) (check if applicable). 


□ 


S0.00 
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A check in the amount of 



to cover the above fees is enclosed. 

Please charge my Deposit Account No 19-0733 in the amount of $1,570.00 



to cover the above fees. 



A duplicate copy of this sheet is enclosed. 

The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any overpayment 
to Deposit Account No. 19-0733 A duplicate copy of this sheet is enclosed 

Fees are to be charged to a credit card WARNING: Information on this form may become public. Credit card 
information should not be included on this form. Provide credit card information and authorization on PTO-203S 



NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive (37 CFR 
1.137(a) or (b)) must be filed and granted to restore the application to pending status. 
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PATENT APPLICATION 
IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Group Art Unit: TBA 
Examiner: TBA 



Atty. Docket No. 000487.00012 



In re Application of 

Derek WOOLFSON et al. 

Serial No. 10/088,417 

Filed: March 18, 2002 

For: PROTEIN STRUCTURES AND PROTEIN FIBRES 

SUBMISSION OF SEQUENCE LISTING 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

In response to the Notice to File Missing Requirements mailed May 21, 2002, in the 
above-identified application, Applicants submit a computer readable form (CRF). The content of the 
CRF and the paper copy submitted herewith are believed to be the same and to add no new matter. It 
is believed that no fee is due. However, if such a fee is deemed necessary, please charge our Deposit 
Account No. 19-0733. 



Respectfully submitted, 



By: 



Dated: July 22, 2002 

Eleventh Floor 
1001 G Street, N.W. 
Washington, D.C. 20001-4597 
(202) 508-9100 



Sarah A. Kagan 
Registration No. 32,141 
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PATENT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Group Art Unit: TB A 
Examiner: TBA 

Atty. Docket No. 000487.00012 



In re Application of 
Derek WOOLFSON et al. 
Serial No. 10/088,417 
Filed: March 18, 2002 
For: PROTEIN STRUCTURES AND PROTEIN FIBRES 

PRELIMINARY AMENDMENT 

Assistant Commissioner of Patents 
Washington, D. C. 20231 

Sir: 

Preliminarily to the examination of the above-identified application, kindly amend the application as 

follows: 

IN THE SPECIFICATION : 

Page 1, after the title, insert the following paragraph: 

This national phase Application of PCT/GBOO/03576 filed September 18, 2000 was published under 
PCT Article 21(12) in English and claims the priority of GB 9922013.9. 

REMARKS 

The amendment to the specification is made in accordance with 35 U.S.C. 119, 37 C.F.R. 1.78 and 
37 C.F.R. 1.55. No new matter has been entered. Entry is requested. 

Respectfully submitted, 



Sarah A. Kagan /\ 



Dated: July 18,2002 

Registration No. 32,141 

BANNER & WITCOFF, LTD. 
1001 G Street, N.W. 
Eleventh Floor 
Washington, D.C. 20001 
TEL: (202)508-9100 
FAX: (202)508-9299 
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PATENT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re Application of 
Derek WOOLFSON et al. 
Serial No. 10/088,417 
Filed: March 18, 2002 



Group Art Unit: TBA 



Examiner: TBA 



Atty. Docket No. 000487.00012 



For: PROTEIN STRUCTURES AND PROTEIN FIBRES 

SECOND PRELIMINARY AMENDMENT 

Assistant Commissioner of Patents 
Washington, D. C. 20231 

Sir: 

Preliminarily to the examination of the above-identified application, kindly amend the 
application as follows: 
IN THE CLAIMS: 

16. (Amended) A protein structure according to any preceding claim in which the first and 
second peptide monomer units have the sequence: 

a) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-pl; SEQ ID NO: 1) and 

b) KIRALKAKNAHLKQEIAALEQEIAALEQ (SAF-p2; SEQ ID NO: 2) respectively; or 
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c) KIAALKQKIAALKQEIDALEYENDALEQ (SAF-plA; SEQ ID NO: 3) and 

d) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C; SEQ ID NO: 4) respectively; or 

e) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-pIC; SEQ ID NO: 1) and 

f) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C; SEQ ID NO: 4) respectively. 

17. (Amended) A peptide monomer unit for use in preparing a protein structure the peptide 
monomer unit having an amino acid sequence selected from: 

a) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-pl; SEQ ID NO: 1); 

b) KIRALKAKNAHLKQEIAALEQEIAALEQ (SAF-p2; SEQ ID NO: 2); 

c) KIAALKQKIAALKQEIDALEYENDALEQ (SAF-plA; SEQ ID NO: 3); 

d) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C; SEQ ID NO: 5); 

e) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-pIC; SEQ ID NO: 1) ; and 
d) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C; SEQ ID NO: 5). 

IN THE SPECIFICATION 

At page 4, paragraph 3, substitute the following paragraph: 

In a preferred protein structure, the first and second peptide monomer units have the following 
sequences: 

478651-1 
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a) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-plC; SEQ ID NO: 1) and 

b) KIRALKAKNAHLKQEIAALEQEIAALEQ (SAF-p2D; SEQ ID NO: 2) respectively; or 

c) KIAALKQKIAALKQEIDALEYENDALEQ (SAF-plA; SEQ ID NO: 3) and 

d) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C; SEQ ID NO: 4) respectively; or 

e) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-plC; SEQ ID NO: 1) and 

f) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C; SEQ ID NO: 4) respectively. 

At page 9, table legend, substitute the following legend: 

*=Chemical capping = CH 3 CO at the N terminus and NH 2 at the C terminus. The sequences in 
the table are SEQ ID NO: 5, 6, 7, 8, 9, 10, 10, 11, and 12, respectively. 



At page 7, paragraph 5, substitute the following paragraph: 

Fig. 1 A and Fig. IB illustrate the design (Fig. 1A) and the sequences (Fig. IB; SEQ ID NOs: 15 
and 16) of self-assembling fibre (SAF) peptide monomers of the invention. 

At page 8, paragraph 2, substitute the following paragraph: 

Fig. 8 shows amino acid sequences (SEQ ID NOs: 17 and 18) designed to form blunt-ended 
heterodimers. 



At page 12, paragraph 1, substitute the following paragraph: 

In addition and as a control, the SAF-p I c sequence was permuted (N- and C-terminal halves 
were swapped) to produce peptide SAF-p3: 
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E IDALEYE NDALEQK IAALKQK IASLKQ (SEQ TD NO: 13.) 

This design should combine with SAF-p2D to form a blunt-ended structure, which should not 
form fibres. 

At page 12, paragraph 2, substitute the following paragraph: 

A model of the three-dimensional structure of the designed protein fibre resulting from the 
assembly of SAF-pl and S AF-p2 was made from the minimised structure of a model coiled-coil 
35-mer, (LAALAAA)s (SEQ ID NO: 14), which was generated using Crick's Equation and had 
an ideally packed interface (G. Offer and R. Sessions, J. Mol. Biol. 249, 967 (1995)). Copies of 
the 35-mer were superimposed with an overlap of one heptad repeat to extend the structural 
template, and the backbone was rejoined after removal of overlapping°segments. Residues in the 
two-stranded template were replaced with the sequences of the SAF peptides, staggered relative 
to each other by two heptad repeats according to the alignment in. Fig. IB. The structure was 
soaked in a 5 A layer of water and energy minimised until the average absolute derivative of 
coordinates with respect to energy fell below 0.01 kcal A-'. The structure was built and 
visualized using insight II 97.0 (Molecular Simulations Inc.), and was energy-minimized using 
Discover 2.9.8 (Molecular Simulations Inc.) with the consistent valence forcefield. In Fig 2(A) 
peptides SAF-p 1 and SAF-p2 (each coloured dark grey-to-light grey from the N-terminus) 
interact through core residues including asparagine pairs (coloured mid-grey) to form the two 
strands of a staggered, parallel, coiled-coil fibre. In Fig. 2(B), negatively charged glutamate side 
chains (coloured light grey) and positively charged lysine side chains (coloured black) form 
complementary charge interactions between the SAF peptides. 

Enter the attached sequence listing at the end of the specification. 
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REMARKS 



The amendments are made to refer to the sequence identifiers in the sequence listing. No new matter 
has been entered. Entry is requested. 



BANNER & WITCOFF, LTD. 
1001 G Street, N.W. 
Eleventh Floor 
Washington, D.C. 20001 
TEL: (202)508-9100 
FAX: (202)508-9299 



Respectfully submitted, 



Dated: July 22, 2002 




Sarah A. Kagan 
Registration No. 32,141 
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IN THE SPECIFICATION 

At page 4, paragraph 3, substitute the following paragraph: 

In a preferred protein structure, the first and second peptide monomer units have the following 
sequences: 

a) KIAALKQKIASLKQEID ALEYENDALEQ (S AF-p 1 C ; SEP ID NO: 1 ) and 

b) KIRALKAKNAHLKQEIAALEQEIAALEQ ( SAF-p2D ; SEP ID NO: 2) respectively; 
or 

c) KIAALKQKIAALKQEID ALEYENDALEQ (SAF-plA ; SEP ID NO: 3 ) and 

d) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C; SEP ID NO: 4 ) respectively; or 

e) KIAALKQKIASLKQEID ALEYENDALEQ (SAF-plC ; SEP ID NO: 1 ) and 

f) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C ; SEP ID NO: 4) respectively. 



At page 9, table legend, substitute the following legend: 

*=Chemical capping = CH 3 CO at the N terminus and NH 2 at the C terminus. The sequences in 
the table are SEP ID NO: 5, 6, 7, 8, 9, 10, 10, 11. and 12. respectively. 



At page 7, paragraph 5, substitute the following paragraph: 

Fig. 1 A and Fig. IB [illustrates] illustrate the design (Fig. 1A) and the sequences (Fig. IB: SEP 
ID NOs: 15 and 16) of self-assembling fibre (SAF) peptide monomers of the invention. 

At page 8, paragraph 2, substitute the following paragraph: 
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Fig. 8 shows amino acid sequences fSEO ID NOs: 17 and 18) designed to form blunt-ended 
heterodimers. 

At page 12, paragraph 1, substitute the following paragraph: 

In addition and as a control, the SAF-p I c sequence was permuted (N- and C-terminal halves 
were swapped) to produce peptide SAF-p3: 

E IDALEYE NDALEQK IAALKQK IASLKQ f SEP ID NO: 13.) 

This design should combine with SAF-p2D to form a blunt-ended structure, which should not 
form fibres. 

At page 12, paragraph 2, substitute the following paragraph: 

A model of the three-dimensional structure of the designed protein fibre resulting from the 
assembly of SAF-pl and SAF-p2 was made from the minimised structure of a model coiled-coil 
35-mer, (LAALAAA)s (SEP ID NO: 14) , which was generated using Crick's Equation and had 
an ideally packed interface (G. Offer and R. Sessions, J. Mol. Biol. 249, 967 (1995)). Copies of 
the 35-mer were superimposed with an overlap of one heptad repeat to extend the structural 
template, and the backbone was rejoined after removal of overlapping°segments. Residues in the 
two-stranded template were replaced with the sequences of the SAF peptides, staggered relative 
to each other by two heptad repeats according to the alignment in. Fig. IB. The structure was 
soaked in a 5 A layer of water and energy minimised until the average absolute derivative of 
coordinates with respect to energy fell below 0.01 kcal A-'. The structure was built and 
visualized using insight II 97.0 (Molecular Simulations Inc.), and was energy-minimized using 
Discover 2.9.8 (Molecular Simulations Inc.) with the consistent valence forcefield. In Fig 2(A) 
peptides SAF-p 1 and SAF-p2 (each coloured dark grey-to-light grey from the N-terminus) 
interact through core residues including asparagine pairs (coloured mid-grey) to form the two 
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strands of a staggered, parallel, coiled-coil fibre. In Fig. 2(B), negatively charged glutamate side 
chains (coloured light grey) and positively charged lysine side chains (coloured black) form 
complementary charge interactions between the SAF peptides. 

IN THE CLAIMS: 

16. (Amended) A protein structure according to any preceding claim in which the first and 
second peptide monomer units have the sequence: 

a) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-pl : SEP ID NO: 1) and 

b) KIRALKAKNAHLKQEIAALEQEIAALEQ fSAF-p2 ; SEP ID NO: 2) respectively; or 

c) KIAALKQKIAALKQEIDALEYENDALEQ (SAF-plA ; SEP ID NP: 3 ) and 

d) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C : SEP ID NP: 4) respectively; or 

e) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-pIC : SEP IDNP: 1) and 

f) KIRALKWKNAHLKQEIAALEQEIAALEQ rSAF-p2C : SEP ID NP: 4) respectively. 

17. (Amended) A peptide monomer unit for use in preparing a protein structure the peptide 
monomer unit having an amino acid sequence selected from: 

a) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-pl : SEP IDNP: 1) ; 

b) KIRALKAKNAHLKQEIAALEQEIAALEQ (SAF-p2 : SEP ID NP: 2 ): 

c) KIAALKQKIAALKQEIDALEYENDALEQ (SAF-plA : SEP ID NP: 3 1: 
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d) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C : SEP ID NO: 5 ); 

e) KIAALKQKIASLKQEIDALEYENDALEQ rSAF-plC : SEP ID NO: 1) ; and 
d) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C : SEP ID NO: 5) . 
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PROTEIN^ STRUCTURES AND PROTEIN FIBRES 



This invention relates to protein structures, to methods of producing those protein structures, 
and to protein fibres and other materials and assemblies produced using those protein 
structures. 

The process of molecular self-assembly is central to all biological systems and is assuming 
increasing importance and application in biotechnology (L. Q. Gu, et al (1999) Nature 398, 
686) and nanotechnology (K. E. Drexler, (1999) TIBTECH 17, 5). The characterization of 
natural biomolecular assemblies motivates and directs the development of model 
self-assembling systems and, in turn, these advance our understanding of biology. For 
proteins at least, the coiled coil is arguably the simplest self-assembling system. Coiled coils 
are protein-folding motifs that direct and cement a wide variety of protein-protein interactions 
(A. Lupas, (1996) Trends Biochem. Sci 21, 375). In structural terms, coiled coils are 
relatively straightforward: they are a-helical bundles with between 2 and 5 strands that can be 
arranged in parallel, antiparallel or mixed topologies. The basic sequence features that guide 
the formation of coiled coils from peptides are reasonably well understood (P. B. Harbury et 
al (1993) Science 262, 1401; D. N. Woolfson and T. Alber (1995) Protein Sci. 4, 1596). For 
instance, most coiled-coil sequences are dominated by a 7-residue repeat of hydrophobic (H) 
and polar (P) residues, (HPPHPPP)*, known as the "heptad repeat". When configured into an 
a-helix this pattern gives an amphipathic structure, the hydrophobic face of which directs 
oligomer-assembly. Furthermore, both the number and the direction of chains within a 
coiled-coil bundle is determined predominantly by residues that form or flank the hydrophobic 
core namely, residues at the first, fourth, fifth and seventh positions of the heptad repeat. For 
instance, coiled coils which form dimers (i.e. two-stranded assemblies) usually have 
isoleucine or valine residues at the first position and a leucine residue at the fourth position. 
By contrast, coiled coils that form trimers (i.e. three-stranded assemblies) often have the same 
residues (i.e both isoleucine or both leucine) at both **H" positions. Finally, hetero-oligomers 
(that is coiled coils made from strands with different amino-acid sequences) may be directed 
by complementary charged interactions that flank the hydrophobic core. For these reasons, 
there have been a number of successful de novo protein designs based on the coiled coil 
These include some ambitious structures that extend the natural repertoire of coiled-coil 
motifs (S. Nautiyal et al (1995) Biochemistry 34, 11645; A. Lombardi et al (1996) 
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Biopolymers 40, 495; D. H. Lee e/ a/ (1996) Nan/re 382, 525; P. B. Harbury et al (1998) 
Science 282, 1462; J. P. Schneider et al (1998) Folding Des. 3, R29). 

In addition to commonly accepted structures with a single, contiguous heptad repeat, the 
inventors have identified sequences with multiple, offset heptad repeats which help explain 
oligomer-state specification in coiled coils. For example, sequences with two heptad repeats 
offset by two residues; i.e a/f-b/g'-c/a'-d/b'-e/c'-f/d'-g/e' set up two hydrophobic seams on 
opposite sides of the helix formed. Such helices may combine to bury these hydrophobic 
surfaces in two different ways and form two distinct structures: open "a-sheets" and closed 
"ot-cylinders". 

Other relevant aspects of coiled-coil structure are described in W099/11774, the disclosure of 
which is incorporated herein by way of reference. 

This understanding of coiled coils, and the resulting protein designs, centres on short 
structures as exemplified by the leucine-zipper motifs (E. K. O'Shea et al (1989) Science 243, 
538; E. K. O'Shea et al (1991) Science 254, 539), which are found in a variety of transcription 
factors. In contrast, most natural coiled coils extend over hundreds of amino acids (A. Lupas 
(1996) supra; J. Sodek et al (1972) Proc. Natl Acad. ScL U.S.A 69, 3800) and many 
assemble further to form thicker, multi-stranded filaments (H. Herrmann and U. Aebi (1998) 
Curr. Opin. Struct Biol. 8, 177). 

With the goal of making elongated structures to improve our understanding of coiled coils, 
and to develop protein-design studies, we initially designed two 28-residue peptides — 
dubbed Self- Assembling Fibre peptides, SAF-pl and SAF-p2 — to fold and form extended 
fibres when mixed. Focusing on the buried, hydrophobic-core positions of the structure, rules 
were incorporated to direct parallel dimer formation and to guard against alternative 
oligomers and topologies (P. B. Harbury et al (1993) supra; D. N. Woolfson and T. Alber 
(1995) supra; L. J. Gonzalez et al (1996) Nature Struct. Biol. 3, 1011). The building block of 
the design was a staggered heterodimer with overhanging or "sticky" ends. This contrasts 
with and distinguishes it from the natural and designer coiled-coil assemblies that have been 
characterized to date, in which the polypeptide strands align in-register, i.e they have blunt or 
"flush" ends. Complementary core interactions and flanking ion-pairs were incorporated into 
the overhangs to facilitate longitudinal association of the heterodimers (Figs. 1&2). This 
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principle of using "sticky ends" is well developed in molecular biology for assembling DNA 
(S. J. Palmer et al (1998) Nucleic Acids Res. 26, 2560), and has been used to design intricate 
DNA crystals (E. Winfiree et al (1998) Nature 394, 539). However, to our knowledge, our 
application of sticky end-directed molecular assembly to peptides is new; although we do note 
that head-to-tail packing of helices has been observed in recently solved crystal structures for 
two designer peptides (N. L. Ogihara et al (1997) Protein Set 6, 80; G. G. Prive et al (1999) 
Protein Set 8, 1400). These were helical peptides that crystallised with their helical ends in 
contact so as to form .pseudo-continuous heEces in the solid state. In other words they formed 
"blunt-ended" arrangements. 

US-A-5/712, 366 discloses self-assembling protein material but does not provide details of 
how to make a staggered parallel heterodimer. WO 96/11947 discloses protein nanostmctures . 
based on bacteriophage T4 tail fiber proteins but does not disclose a staggered parallel 
heterodimer coiled coil structure. 

Pandya et al., Biochemistry, 29, 8728-34, 2000 (published after the priority date of the present 
application) does not disclose a method of making nanotubes and does not disclose a matrix 
comprising the protein structures of the present invention- 
According to one aspect of the invention there is provided a protein structure comprising a 
plurality of first peptide monomer units arranged in a first strand and a plurality of second 
peptide monomer units arranged in a second strand, the strands preferably forming a coiled- 
coil structure, and in which a first peptide monomer unit in the first strand extends beyond a 
corresponding second peptide monomer unit in the second strand in the direction of the 
strands. The protein structures of the invention have numerous advantages. For example, 
relatively long protein fibres can be formed with little material - 1 pi of a: 100 pM solution of 
the peptide monomers may provide enough material to form 10 m of fibre 50 ran thick. 

At least one charged amino acid residue of the first peptide monomer unit may be arranged to 
attract an oppositely-charged amino acid., residue of the second peptide monomer unit. 
Preferably, the charged amino acid residue is -in an end portion of the first peptide monomer 
unit, which extends beyond the corresponding second peptide monomer unit in the second 
strand. At least one strand may consist solely of first or second peptide monomer units 
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respectively i.e homogenous strands. Heterologous strands are also contemplated. The 
peptide monomer units may comprise a repeating structural unit. Preferably, the repealing 
structural unit comprises a hep tad repeat motif, having the pattern: 

hpphppp 
abcdefg 

Preferably, the repeat may include isoleucine or asparagine at position a and leucine at 
position d. Other repeats (e.g hendecads - abcdefghijk) and amino acid compositions may 
also be used (see W099/1 1 774). 
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Preferably, the heptad repeat comprises oppositely-charged residues at positions e and g 
respectively. The oppositely-charged residues may be, for example, glutamic acid and lysine 
residues or arginine and aspartic acid. The use of synthetic amino acids, such as ornithine is 
also envisaged. 

A protein structure in accordance with the invention may be also specified by pairs of 
asparagine residues in the "a" positions provided by corresponding first and second peptide 
monomer units. 

In a preferred protein structure, the first and second peptide monomer units have the following 
sequences: 

t 

a) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-plC)and 

i 

b) KI RALKA.KNAHLKQE I AALEQE I AALEQ (SAP-p2D) respectively; or 

? 

c) KIAALKQKIAALKQEIDALEYENDALEQ (SAF-pl A) and 

d) KIRALKWKNAHLKQEIAALEQEIAALEQ ( SAF-p2C) respectively; or 

1 

e) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-plC) and 

i 

f) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C) respectively. 

It will be appreciated that these are examples only of 4-heptad structures and that other 
lengths are possible and envisaged for use in the invention. 

According to another aspect of the invention, there is provided a method of producing protein 
structures, the method comprising providing a mixture of first and second peptide monomer 
units which associate to form a protein structure according to the invention. The structure can 
be derivatised and/or stabilized by cross-linking. 

Derivatization of the peptide monomer units before or after assembly into the protein 
structures of the invention may be performed. For example, fluorescent moieties 
(fluorophores) may be attached to the coiled coil as described in W099/1 1774. The addition 
of fluorescent moieties may assist visualization of the protein structure. Substitution with 
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anctional groups at the "f position in the heptad repeat is especially preferred as that 
position is on the outside of the helix (see Fig. 1C and IE). Other derivatives may include 
attaching binders to the peptide monomer units for example so that units which can bind other 
entities can be produced. 

The first and second peptide monomers and the strands may have the characteristics described 
above. 

The invention also provides protein fibres produced by an association of protein structures 
according to the invention. 

The protein structures may also be arranged to form tubular structures. In particular, the 
structures may be arranged to form nanotubes. 

According to another aspect of the invention, there is provided a kit for making protein 
structures, the kit comprising first and second peptide monomer units which associate to form 
a protein structure or protein fibres according to the invention. 

The protein structures of the invention may be assembled in two and three dimensional arrays. 
For example, two dimensional mats can be formed which can function, for example as filters. 
Three dimensional grids or matrices can also be formed again, for example, for use as sieves 
or filters or for organising other associated or conjugated molecules in three dimensions. 

In a preferred embodiment, a matrix is assembled in situ. For example, a matrix can be 
formed in a solution to entrap contaminants in the solution and then the matrix, together with 
contaminants, can be removed from the solution for example by centrifugation. 

The stability of the protein structures at higher temperatures may be improved by making the 
peptide monomers longer, such that the overlap between corresponding first and second 
monomer unit residues is increased. Increases in monomer length have previously been 
shown to stabilize coiled coil structures. Alternatively, stability can be improved by 
introducing bonding between adjacent peptide monomer units in the same strand. For 
example, Kent (Dawson et al (1994) Science 266: 776) and co-workers have produced peptide 
bonds between adjacent polypeptide units by coupling and subsequent rearrangement of a 
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cysteine residue at the N end of one polypeptide unit to a thio-ester derivatised C-terminus of 
another unit. 

Additionally, the protein structures may be stabilised and derivatised by using them to 
template the polymerisation of synthetic polymers. 

Definitions 

The terms used in the specification are to be given the ordinary meaning attributed to them by 
the skilled addressee. The following is given by way of clarification: 

Amino acid. 

This term embraces both naturally-occuring amino acids and synthetic amino acids as well as 
naturally-occuring amino acids which have been modified in some way to alter certain 
properties such as charge. In all cases references to naturally-occurring amino acids may be 
considered to include synthetic amino acids which may be substituted therefor. 

Coiled Coil 

A coiled-coil is a peptide/protein sequence usually with a contiguous pattern of hydrophobic 
residues spaced 3 and 4 residues apart, which assembles (folds) to form a multi-meric bundle 
of helices. Coiled-coils including sequences with multiple offset repeats are also 
contemplated. 

Dimer 

A dimer is a two stranded structure. 
Heterodimer 

A heterodimer is a dimeric structure formed by two different stands. 
Staggered heterodimer 

A staggered heterodimer is a structure in which the two strands assemble to leave overlapping 
ends that are not interacting within the heterodimer. 
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ant-end assembly 

Blunt-end assembly is association where the two strands combine to give flushed i.e 
non-overlapping ends. 

Protofibril 

A protofibril is a protein structure assembled longitudinally from staggered heterodimers 
interacting through their overhanging ends. 

Fibre 

A fibre is a structure formed by lateral association of two or more protofibrils. 

Protein structures and methods of producing protein structures in accordance with the 
invention will now be described, by way of example only, with reference to the accompanying 
Figures 1 to 8 in which: 

Fig. 1 illustrates the design and the sequences of self-assembling fibre (SAF) peptide 
monomers of the invention. 

Fig. 2 illustrates computer modelling of the designed self- assembling fibre of the invention. 

Fig. 3 illustrates the results of circular dichroism (CD) and linear dichroism (LD) experiments 
on protein structures of the invention. 

Fig. 4 illustrates the assembly of synthetic protein fibres visualized directly by transmission 
electron microscopy and an analysis of fibre width In all panels, the white scale bars represent 
100 nm. Fig. 4D is a histogram showing the distribution of fibre widths determined using 
TEM for fresh (white bars) and matured (black bars) mixtures of SAF peptides at 100 \xM (a 
width value of "jc" on the histogram includes all measurements made from "(x—5) to x"). 

Fig. 5 is a cartoon showing the possible anti-typic association of parallel helical peptides 
leading to a homo-oligomeric peptide nanotube. 

Fig. 6 is an x-ray diffraction pattern of an aligned protein fibre of the invention. 
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peptides at 100 jiM (a width value of 'V on the histogram includes all measurements made 



from %v-5) to x J ). 

Fis^- 5 is a cartoon showing the possible anti-typic association of parallel helical peptides 
leading to a homo-oligomeric peptide nanotube. 

Fig. 6 is an x-ray diffraction pattern of an aligned protein fibre of the invention. 

Fig. 7 is an image from a confocal fluorescent microscope showing fibres which have been 
derivatised through the inclusion of flurophores; and 

Fig. S shows amino acid sequences designed to form blunt-ended heterodimers. 
1) Peptide Design and Synthesis 

Various peptide monomer units were designed as described above. The monomers and 
capping peptides (designed to complement the sticky ends of the monomers so as to produce 
flush, or blunt ends and, so, arrest longitudinal fibre assembly) are set out in Table 1 : 



Printed: 10-05-2001 
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Fig. 1 shows (A) A mechanism for self-assembly: complementary charges in "companio 
peptides direct the formation of staggered, parallel heterodimers; the resulting "sticky" en 
are also complementary and promote longitudinal association into extended structures. F 
1(B) shows the designed amino acid sequences: each peptide comprised canonical hept; 
repeats (abcdefg) with He at a and Leu at d to guide the formation of coiled-coil dimei 
oppositely-charged residues were incorporated at e and g to favour the staggered dimer wi 
sticky ends; asparagine residues (which preferentially pairs with each other at a sit< 
(Gonzalez L et al (1996) Nature Structural Biology 3, 13: 1011-1018) were included ■ 
cement the prescribed register further and to favour the parallel structures. Fig. 1(C) is 
helical-wheel representation, summarizing the designed sequences in context. The view 
from the N-terminus with heptad sites labeled a-g and assumes 3.5 residues per helical turn t 
emphasise the heptad repeat. 

The peptides were synthesized on an Applied Biosystems 432A Peptide Synthesizer usin 
solid-phase methods and Fmoc chemistry. Peptide samples were purified usin. 
reversed-phase HPLC and their identities confirmed by MALDI-TOF mass spectrometry. 

Various combinations of peptide monomers and capping peptides were tested as set out ii 



Table 2: 
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In addition and as a control, the SAF-plc sequence was permuted (N- and C-terminal halves 
were swapped) to produce peptide SAF-p3: 

E I DALE YE NDALEQK IAALKQK IASLKQ 

This design should combine with S AF-p2D to form a blunt-ended structure, which should not 
form fibres. 

2) Modeling of Protein Fibre Structure 

A model of the three-dimensional structure of the designed protein fibre resulting from the 
assembly of SAF-pl and SAF-p2 was made from the minimised structure of a model 
coiled-coil 35-mer, (LAALAAA) 5 , which was generated using Crick's Equation and had an 
ideally packed interface (G. Offer and R. Sessions, J. Mol. Biol 249, 967 (1995)). Copies of 
the 35-mer were superimposed with an overlap of one heptad repeat to extend the structural 
template, and the backbone was rejoined after removal of overlapping segments. Residues in 
the two-stranded template were replaced with the sequences of the SAF peptides, staggered 
relative to each other by two heptad repeats according to the alignment in Fig. IB. The 
structure was soaked in a 5 A layer of water and energy minimised until the average absolute 
derivative of coordinates with respect to energy fell below 0.01 kcal A" 1 . The structure was 
built and visualized using Insight TI 97.0 (Molecular Simulations Inc.), and was 
energy-minimized using Discover 2.9.8 (Molecular Simulations Inc.) with the consistent 
valence forcefield. In Fig 2(A) peptides SAF-pl and SAF-p2 (each coloured dark 
grey-to-light grey from the N-terminus) interact through core residues including asparagine 
pairs (coloured mid-grey) to form the two strands of a staggered, parallel, coiled-coil fibre. In 
Fig. 2(B), negatively charged glutamate side chains (coloured light grey) and positively 
charged lysine side chains (coloured black) form complementary charge interactions between 
the SAF peptides. 



3) Circular Dichroism Experiments 

Peptide samples were incubated at 5°C in 10 mM MOPS (3-(N-Morpholino)propanesulfonic 
acid), pH 7. Sample concentrations were determined from their UV absorbance at 280 nm 
(SAF-pl) and 214 nm (SAF-p2). After baseline correction, ellipticities in mdeg were 
converted to molar ellipticities (deg cm 2 dmol-res 1 ) by normalizing for the concentration of 
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peptide bonds. Data were recorded in a cell of 1 mm path length by integrating the signal for 
5s (and Is for the fresh 100 pM peptide mixture) every nm in the range 205-260 nm. CD 
measurements were made using a JASCO J-715 spectropolarimeter fitted with a Peltier 
temperature controller. 

The CD data shown in Fig. 3 provides spectroscopic evidence for the formation of helical 
structures by the SAF peptides. Fig. 3(A) shows circular dichroism (CD) spectra at 10 pM 

for: SAF-pl (- ), SAF-p2 (- -), the average of these spectra ( ), and the experimental SAF 

peptide mixture (O). Fig 3(B) shows CD spectra at 100 pM - the key is the same as for Fig 
3(A), but with the additional spectrum (•) being for the SAF peptide mixture after 
"maturation" for 1 h. 

Consistent with our design, neither SAF-pl nor SAF-p2 was highly structured in aqueous 
solution at pH 7 and 5 °C (Fig. 3). However, when mixed in equal proportions the circular 
dichroism (CD) spectrum changed and, moreover, was markedly different from the 
theoretical spectrum generated by averaging the spectra for the isolated peptides. In 
particular, the spectrum for the mixture had intense minima at 208 and 222 nm consistent 
with the formation of a-helical structure, but these features were not as pronounced in the 
spectra of the individual peptides. This was clear evidence that the two peptides interacted to 
form an a-helical structure as designed. Furthermore, and as expected for a multimerization 
event, the magnitude of these spectral changes depended on peptide concentration; a SAF 
mixture with 10 pM of each peptide, did show a weak signal indicative of some a-helical 
structure, however, a 100 pM mixture gave a much stronger signal (Figs. 3A&B). 

The shape and intensity of spectra from 100 pM mixtures of the SAF peptides also changed 
with time (Fig. 3B). Spectra recorded immediately after mixing a "fresh" sample displayed 
some a-helical structure. After incubation of the mixture for 1 hour at 5 °C ("maturation"), 
however, the signal at 222 nm was more intense, and indicated approximately 75 % a-helix, 
consistent with substantial coiled-coil formation. 

Maturation of 100 pM SAF peptide mixtures was also accompanied by slight clouding of the 
samples. Scattering effects from such samples can lead to attenuation and distortion of CD 
spectra (D. Mao and B. A. Wallace, (1984) Biochemistry 23, 2667). However, we could 
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disregard this possibility because altering the distance between the sample and the detector in 
the CD instrument did not affect the shape or the intensity of the spectrum. Furthermore, we 
established that the majority of the CD signal from the mixtures derived from the suspended 
material: a supernatant without the suspended material, which was recovered by 
centrifugation of a matured 100 jiM SAF mixture, gave only a weak CD signal similar to the 
10 jiM mixture. 

Thus, the CD data were wholly consistent with the desired a-helical SAF design and, 
moreover, indicated the formation of large assemblies. 

As a control, SAF-p3 (the permutation of SAF-pl (identical to SAF-plc)) was designed to 
form a blunt-ended heterodimer with SAF-pl that should not assemble further into fibres. 
100 nM mixtures of SAF-p2 (identical to SAF-p2D) and SAF-p3 were analysed by 
sedimentation equilibrium in the analytical ultracentrifuge. The resulting data were best 
fitted assuming a single ideal species in solution, and the molecular weight was allowed to 
vary during the fit. An M r of 6422 (with 95% confidence limits of 5924 and 6911) was 
obtained, which is very close to the expected heterodimer value of 6303 calculated from mass 
spectrometry of the individual peptides. CD spectra for 100 (iM fibre-producing mixtures 
(SAF-pl with SAF-p2), and for blunt dimer-producing mixtures (SAF-p2 with SAF-p3), were 
recorded. For the blunt dimer-producing mixtures, the shape and intensity of the CD 
spectrum were fully consistent with coiled-coil formation as designed. In contrast to the 
fibre-producing mixtures, the blunt dimer-producing mixtures showed no signs of maturation; 
that is, negligible spectral changes and no clouding of solutions occurred upon incubation. 
Interestingly, the intensity of the minimum near 222 nm, which is an accepted indicator of 
a-helical structure and degree of a-helical folding, was similar for both mixtures. This 
strongly supports the formation of a-helical structure as designed in the fibre-producing 
mixtures despite the spectral shifts observed upon maturation. 

4) Linear Dichroism Experiments 

Linear dichroism (LD) spectroscopy was also used to test if elongated structures were being 
formed as designed. Long polymers such as DNA molecules can be oriented by shear flow. 
This effect can be monitored by LD spectroscopy provided that chromophores also become 
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aligned by the flow (M. Bloemendal (1994) Chem. Soc. Rev. 23, 265; A. Rodger and B. 
Norden (1997) Oxford Chemistry Masters (Oxford University Press, Oxford), vol. 1). 

Peptide samples were prepared for LD as for CD. LD data were collected on samples 
spinning in a couette flow cell by integrating the signal for 2 s every nm in the range 210-320 
nm, using a JASCO J-715 spectropolarimeter. After baseline correction, absorbance was 
converted to molar extinction coefficient (1 mol-res 1 cm* 1 ) by normalizing for the 
concentration of peptide bonds. A linear correction for a sloping baseline was made to the 
data from the 100 nM SAF peptide mixture. 

The results are depicted in Fig. 3D, which shows linear dichroism (LD) spectra for: 20 

tropomyosin ( ), the SAF peptide mixture at 10 \xM ( — ), and the SAF peptide mixture at 

100 \xM in the absence (•) and presence (O) of 0.5 M KF. 

For instance, we found that tropomyosin, which forms a dimeric coiled coil approximately 42 
nm in length, could be aligned to give a LD signal (Fig. 3D). In contrast and consistent with 
our design and the CD data, experiments with a 10 \iM SAF mixture, (Fig. 3D), and for the 
individual peptides at 100 (data not shown), LD signals were not detected. However, a 
matured 100 jaM SAF peptide mixture gave a strong absorbance from the peptide backbone 
(210-240 nm) and some signal in the aromatic region (260-290 nm) during flow orientation 
(Fig. 3D). As only long structures are aligned by this technique, the data demonstrated that 
long fibres at least 500 run in length were present in solutions of the matured 100 SAF 
peptide mixtures. 



5) Electron Microscopy 

To confirm fibre assembly, we used electron microscopy to visualize structures in the peptide 
preparations directly. For TEM experiments, peptide samples were incubated for 1 h at 5 °C 
in filtered 10 mM MOPS, pH 7. A drop of peptide solution was applied to a carbon-coated 
copper specimen grid (Agar Scientific Ltd, Stansted, UK), and dried with filter paper before 
negative staining with 0.5% aqueous uranyl acetate and then dried at 5 °C. A "fresh" SAF 
peptide mixture was prepared by mixing preincubated solutions of the individual peptides at 
200 ^M directly on the specimen grid, before drying and negative staining as described. 
Grids were examined in a Hitachi 7100 TEM at 100 kV and digital images were acquired 
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with a (800 x 1200 pixel) charge-coupled device camera (Digital Pixel Co. Ltd., Brighton, 
UK) and analyzed (Kinetic Imaging Ltd., Liverpool, UK). 

For scanning electron microscopy (SEM) experiments, negatively-stained specimen grids 
were sputter-coated with gold and examined in a Leo Stereoscan 420 SEM at 20 kV and with 
a probe current of 10 pA. 

No structures were visible up to 100 000 times magnification by transmission electron 
microscopy (TEM) for either the 10 jliM SAF mixture, or for the individual peptides at 100 
fiM concentration (data not shown). However, TEM of a 100 jiM SAF mixture at 50 000 
times magnification revealed time-dependent formation of long fibrous structures, consistent 
with the CD and LD data. Fresh mixtures showed large numbers of extended fibres of 
various widths. The majority of these had a diameter of about 20 nm (Figs. 4A (a fresh 
mixture at 100 jiM) & Fig 4D); finer fibres were present, but their widths could not be 
measured reliably. Images recorded for the matured mixtures showed fewer fibres, but these 
were more distinct and thicker than those observed in the fresh mixture (Fig. 4B&D). 

Scanning electron microscopy (SEM) of a matured mixture showed no evidence for fibre 
branching. Rather, the fibres were simply intertwined as if layered on top of each other (Fig. 
4C). It was not possible to follow the full length of fibres due to intertwining, but they were 
at least several hundred microns in length. Although the density of fibres varied across the 
surface of the EM grid, for the matured samples at least, their diameters were quite uniform 
with a mean width of 43.3 (SD = 9.3) nm (Fig. 4D). As the original design was for a 
longitudinally extended, but otherwise two-stranded coiled coil the average diameter that we 
might have expected was about 2 nm. Therefore, the EM data suggested that the designed 
two-stranded coiled-coil fibres associate laterally into higher order assemblies. 



6) X-ray Fibre Diffraction 

Mixtures of SAF peptides at 500 jiM in 10 mM MOPS, pH 7, were incubated on ice for at 
least lh, before centrifugation at 6500g for 5 mim Droplets of fibre-containing solutions, 
taken from the bottom of the centrifuged tubes, were suspended between the ends of two 
wax-filled capillaries and allowed to dry slowly overnight at 4°C, yielding clumps of partially 
aligned fibres. X-ray fibre diffraction i ges were collected using a Rigaku CuKa rotating 
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anode source (wavelength 1.5418 A) and a R-AXIS IV detector. Samples were maintained at 
5°C during data collection with cool air from a cryostream (Oxford Cryo-systems). The X-ray 
fibre diffraction pattern collected from SAF peptide fibres showed the following features 
(Fig. 6): (1) a short meridional (that is, parallel to the long fibre axis) reflection at 5.1 1± 0.03 
A; (2) the harmonic of this 5.1 1 reflection at 10.19 ± 0.05 A; and (3) a stronger, more diffuse 
reflection centered at 8.8 ± 0.15 A on the equator. These features are consistent with 
a-helical coiled-coils aligned with the fibre axis. The 5.1 A meridional reflection 
corresponds to the pitch of the helices within the coiled-coils. The other expected reflection 
on the meridian-that is, that at 1 .5 A and corresponding to the rise per residue-lies out of the 
resolution of the current data sets, whereas the equatorial reflection reveals the mean distance 
between a-helical axes. This value at 8.8 A is less than the observed value for keratin but 
falls within reported ranges for dimeric coiled-coil peptides. 



7) Effect of Potassium Fluoride on Protein Fibre Assembly 



Molecular modeling of the SAF sequences into an extended two-stranded coiled coil also 
highlighted potential complementary charge interactions on the surface of the protofibrils, 
Figs 1&2. In accordance with this, experimentally it was found that moderate concentrations 
of salt inhibited protofibril and thick fibre assembly. First, CD spectra recorded for both the 
individual peptides and a 100 mixture of SAF peptide samples with 0.5 M potassium 
fluoride showed reduced helical CD signals and there was no evidence of "maturing" in the 
mixed samples (Fig. 3C). Second, the LD signal described previously for the matured 100 
^iM SAF peptide mixture was also lost when the experiment was repeated in the presence of 
salt (Fig. 3D). Finally, TEM images of a 100 jiM SAF mixture also demonstrated that fibres 
were not formed in the presence of 0.5 M KF (Fig. 4E). Fig. 4E shows the results of TEM of 
a matured SAF peptide mixture at 100 p.M incubated in the presence of 0.5 M KF. 
The inventors did not knowingly design any features into the SAF peptides to foster further 
association of the two-stranded coiled coils. The observation of thick fibres in SAF peptide 
preparations, therefore, raised the question: what interactions guided and stabilized these 
higher-order assemblies? The inventors therefore propose that features inherent in repeating 
structures of the type that they designed will naturally promote such fibre assembly 
(fibrillogenesis). 
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Consider a protofibril as depicted in Fig. IB and 2A. Any sequence feature presented on its 
surface by either, or both of the constituent peptides will be repeated at regular intervals along 
the protofibril. The repeat length will be equal to the length of the peptides (for SAF-pl and 
SAF-p2 this was 28 residues, or about 4.2 nm). Furthermore, the motif will spiral around the 
protofibril tracking the superhelix of the coiled coil, which has a pitch of about 15 nm for a 
contiguous, heptad-based, dimeric structure. In this scenario, protofibril-protofibril 
interactions may be promoted if another sequence motif complementary to the first is present 
in the potential partner. This is because the pitches of the complementary motifs on each 
protofibril will match precisely. Thus, once initiated, lateral association of protofibrils — 
that is, fibrillogenesis — will be cemented by many regularly spaced interactions as in a 
crystal. As a result, the complementary interactions need only be weak as the stability of the 
protofibril-protofibril interaction rests on an avidity effect rather than a small number of 
strong interactions. Provided that the components of the assembly can make more than one 
type of complementary surface very extensive molecular assemblies may result. 
The inventors used electrostatic interactions both to direct heterodimer formation, and to 
promote elongation of the protofibrils (Figs. 1 and 2). These features would also create 
periodic and alternating patches of charge in the protofibrils provided they are regular as 
envisaged (Fig. IB and 2B). These charged patches could guide and stabilize the higher order 
assemblies. Indeed, similar features have been noted in several natural fibrous proteins and 
have been implicated in the assembly of multi-protein filaments (J. J. Meng et al (1994) Biol. 
Chem. 269, 18679; A. D. McLachlan and M. Stewart (1976) MoL Biol. 103, 271), and small 
synthetic peptide systems (S. G. Zhang et al (1993) Proc. Natl. Sc. U.S.A 90, 3334). The 
experiments with salt (KF) described above suggest that salt-bridges (electrostatic interaction) 
may be at least in part the cause of fibrillogenesis. 



8) Coiled-coils design 

a. For two superimposed heptads there are three possible sequence offsets of 1, 2 and 3 
residue(s), which are equivalent to 6, 5 and 4-residue offsets, respectively. For a regular 
3.6-residue-per-tum a-helix, these set up two hydrophobic faces with angular offsets of 
100°, 160° (360-200) and 60° (360-300), respectively, around the outside of the helix. 
This is best seen on a helical wheel. Accounting for helical supercoiling - i.e assuming 
3.5 residues per turn and using the accepted helical-wheel representation for the 
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coiled-coil these angular offsets are altered to 103°, 154° and 51°, respectively. However, 
both sets of angles are over-simplifications when considering helix-helix interactions in 
actual coiled-coil systems because side-chain size, geometry and packing also affect the 
helix interfaces (Harbury, P. B. et al (1993) Science 262, 1401-1407; Harbury, P. B. et al 
(1994) Nature 371, 80-83; Malashkevich, V. N. et al (1996) Science 274, 761-765). 
Nonetheless, we found that many natural coiled-coil assemblies, at least, were consistent 
with the approximate angular offsets: Trimers could be considered as having overlapping 
heptads separated by 3 residues (angular offset = 51/60°). Whereas, tetrameric and 
pentameric coiled-coils were often variations on a theme with two hepad repeats offset by 
1 residue (100/103°). 



b. Two heptad repeats offset by two residues: a-cylinder constructions 

Sequence offsets of 2 residues are potentially more interesting than the 1- and 3-residue 
offsets. This is because of the possibility of placing hydrophobic (H) residues at a, c, d, 
and f, with c and f effectively making up the a' and d' positions of the second, offset 
heptad. This is represented below, where P signifies polar (non-core) residues. 

abcde fgabcdefg repeat 1 

HPPHPPPHPPHPPP binarypattem 1 

PPHPPHPPPHPPHP binary pattern 2 

f 'g'a'b'c'd'e'f ■g'a'b'c'd'e' repeat 2 



abcdefgabcdefg assigned register 

HPHHPHPHPHHPHP overall binary pattern 

Such sequence patterns would results in two hydrophobic seams with a wide angular 
separation (154/160°), which would place them roughly on opposite sides of the helix. 
Furthermore, it offers two possibilities for parallel helix-helix packing arrangements; syn, 
where two like faces - i.e a / d with a / d , or c/f with c/f - from neighbouring helices 
combine to produce an openct-sheet, Fig. 6a; anti, where a/d faces pair with c/f. In the 
anti-arrangement the structure can close to form a a-cylinder. For antiparallel pairs of 
helices syn-typic association should lead to cylinders, whereas sheets should be formed 
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from anti-typic antiparallel interfaces. 



A natural cc-cylinder 

TolC has two oc-barrel-like domains (Koronakis, V. et al (2000) Nature 405, 914-919). 
Both have 12 helices contributed by 3 monomers. In the lower barrel each helix pairs 
with another from the same protomer to form separate supercoiled, antiparallel 
coiled-coils; SOCKET analysis revealed extensive antiparallel knobs into holes (KIH) 
interactions within these pairs, but not between them. In contrast, the helices of the upper 
barrel appear to pack more uniformly, albeit with a slant, to describe an a-cylinder. The 
SOCKET output for this part of the structure revealed many fewer KIH interactions than 
found in the lower barrel. Furthermore, KIH interactions were not contiguous around the 
cylinder and, in particular, they were more extensive between helices in the same 
monomer, but less regular between the helices abutting the monomers. In our view, the 
TolC barrel represents a variation of the cylinders formed by protein structures of the 
invention. 



Nevertheless, the inventors were able to assign heptad registers for the helices of the 
upper barrel unambiguously. This revealed knobs at relative a, c, d, and/ positions and 
syn-typic association of two seams adjacent helices; i.e fully consistent with the theory 
outlined above. 



We believe that it will be possible to constuct oc-sheets and oc-cylinders using helices in 
parallel. The use of parallel helices does have one interesting consequence for the 
construction of a-cylinders, however: as the pairing in these structures will be anti-typic, 
a residues on one helix partner c residues of a neighbouring helix at the same level in the 
structure. Similarly, d and / residues pair at the intervening levels. The result will be that 
successive helices will be translated up the helix and cylinder axes by two residues, which 
is equivalent to ~3A. Thus, attempts to construct a-cylinders from parallel helices will 
give spirals of helices which may or may not close. This is, however, potentially 
extremely interesting as it opens up possibilities for making peptide-based nanotubes as 
described above. 
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A second consideration for a-cylinder construction is the consequences of helix and 
coiled-coil supercoiling. The upper barrel of TolC has 12 helices. Based on a structure of 
parallel helices with canonical supercoiling, i.e an angular separation of 154° between the 
two seams in each helix, we calculated that the cylinder should close at 14 helices. 
However, variations in helix number are expected. One reason for this is that helices 
cannot supercoil in two direction simultaneously, and some distortion is required to 
maintain packing at both interfaces. We found structural precedents for this in the Protein 
Data Book PDB where tight knobs-into-holes packing was maintained (Walshaw & 
Woolfson, unpublished); indeed, the central helices of the 3 -helix a-sheets are straight, 
Fig. 7b. (n.b . The slanting of the helices in the upper barrel of TolC may offer a 
compromise between straight and supercoiled helices). Assuming the packing of 
completely straight helices, the angular offset becomes 160° and 18 helices would close a 
cylinder. However, given that, as in 3-, 4- and 5 -stranded coiled coils, side chains 
mediate the helix-helix contact angles other oligonmerisation states might be possible 
(Harbury, P. B et al (1993) Science 262, 1401-1407; Harbury, P. B. et al (1994) 371, 
80-S3; Malashkevich, V. N. et al (1996) Science 274, 761-765): we calculate that small 
adjustments in the angular offset between 144° to 162° varies the helix number from 10 to 
20. 

9) Formation of Protein Structures 

As mentioned above, the protein structures of the invention may have various applications 
such as in: 



Nanotubes 

a. This can be achieved for example by combining the aforementioned 7- and 11 -residue 
repeats with offsets in the sequence. The effect would be eliminate the overall 
hydrophobic displacement. In other words, alternating heptad and hendecad repeats give 
an 18-residue repeat to match the a-helical repeat; in the a-helix, 18 residues span 5 
helical turns exactly. It may therefore be possible to create a completely closed peptide 
nantotube (Fig. 5 shows part of a nanotube) In the parallel, straight helix case there would 
be 18 helices per turn of the "cylinder", and the rise per turn is 36 residues. Thus, a 
36-residue peptide with a 7-11-7-11 repeat offset by 2 residues should form a spiral of 
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helices the ends of which meet to close the tube. Such nanotubes maybe particularly 
useful in the production of nanoscale piping and plumbing. The interior of the tube may 
be derivatised to control the flow of different small (2-40A) molecules. 



. Derivitised and branched peptides and peptide templates 

The self-assembling peptides of the invention are relatively small and synthetically 
accessible. Thus, non-standard derivatisable side chains may be incorporated in them. 
For example, the monomer units can be made with a single cysteine residue at an exterior 
f position. These can be used to couple small molecules and other peptides using 
thiol-based chemistry. A wide variety of thiol-reactive probes are available. In particular, 
the peptides can be tagged with fluorophores. For instance, with one peptide labelled 
with Fluorescein and the other with Rhodamine fibres visualised by confocal microscopy 
appear green and red, respectively (Fig. 7). There is a possibility for FRET between the 
probes, which may pack closely in the fibres, and this may confuse interpretation. To 
avoid this the tagged peptides can be doped into fresh, assembling SAF mixtures. Having 
available fluorescently labelled peptides and fibres offers another route to tracking 
fibre/network assembly and orientation. 



To generate branched self-assembling fibres "T-shaped" conjugated peptides can be 
made. These are covalent heterodimers made by mixing and coupling together variants of 
two SAF peptides: one with a terminal cysteine and the other having a central cysteine 
residue. The desired products can be purified from the mix of disulphide-linked peptide 
by PHLC. Doping the conjugated ("T") peptides into fresh SAF mixtures should 
propagate fibre assembly in three dimensions as both the "bar" and the "stem" of the "T" 
could become incorporated in, or initiate, fibres. The resulting networks can be visualised 
and characterised by EM. 



Peptide synthetic diblock copolymer hybrids may be produced. Suitable methods for 
preparing water soluble diblock copolymers using atom transfer radical polymerisation 
are described in X. S. Wang et al Chemical Communications 1817 (1999) and X. S Wang 
et al Macromolecules 33, 257 (2000). 

The protein fibres of the invention may be used to template and control this 
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polymerisation either to produce hybrid fibres or if the peptide template is subsequently 
disassembled and marked away, to provide routes to water soluble "fishnet" nanotubes. 
Other possibilities include: conjugating polymers onto preassembled peptide fibres; 
conjugating the polymers and peptides prior to fibre assembly; and effecting 
polymerisation on the pre-assembled fibres. 



c. Formation of Matrices 

The protein fibres of the invention may be arranged to form two and three dimensional 
grids and matrices respectively. One application for such matices is in the purification of 
biological fluids such as blood. An affinity matrix could be assembled (for example in 
situ in blood) to remove blood contaminants such as viruses. In the case of virus removal, 
a binder for the target contaminant (e.g a peptide or protein with natural or engineered 
affinities for a viral coat protein) can be fused to a peptide monomer units in the protein 
structure of the invention. The matrix can then be removed from blood along with any 
bound contaminants by light centrifligation. For example, it is estimated that a 100 nm 
length of fibre would have a mass of > 12 MDa which would readily be removed. Such 
affinity matrices have a number of advantages over larger naturally occurring proteins. In 
the assembled matrices any binders are aligned to give high effective avidities for the 
targeted molecules. 



d. Other applications 

Other applications for protein structures in accordance with the invention include: 
L preparation of organised networks for seeding the crystalisation of biomolecules for 
X-ray crystallography; 

ii. using ordered fibres to promote cell growth for tissue engineering; 

iii. the construction of nanoscale molecular sieves 

iv. the preparation of nanoscale molecular grids/scaffolds that could be used as supports 
for a variety of functional small or macromolecules. 

v. fiinctionalised grids and networks could be used in, for example, catalysis, 
affinity-sieving/purification of biological fluids and other research solutions, the 
recruitment of endogenous molecules and co-factors to promote tissue repair and 
tissue engineering in general. 
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vi. to create novel lab-on-chip technologies, peptide self-assembly could be combined 
with lithography as follows. 

Lithography and related techniques can be used to pattern a variety of surfaces with 
channels, which can be made of a suitable size (e.g 20-100 nm wide and deep) to 
accommodate peptide fibres. These can then be used to direct the assembly of the 
fibres from solutions mixed directly on the surfaces. Furthermore, using 
well-established chemistry, the inventors envisage funtionalising the peptide fibres 
with a variety of small molecules and other proteins. This proposed combination of 
peptide design, self-assembly and lithography should allow the development of 
ordered arrays of functional polymers on specific surfaces. 

vii. Assembled fibres could also be used as fine (therefore, high resolution) tips in AFM 
(atomic force microscopy) the current limit is about 10-25 nm using carbon 
nanotubes. 
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Claims 

1 . A protein structure comprising a plurality of first peptide monomer units arranged in a 
first strand and a plurality of second peptide monomer units arranged in a second 
strand wherein the first and second monomer units comprise the heptad repeat motif 
(abcdefg) and/or the hendecad repeat motif (abcdefghijk), and wherein a pair of 
asparagines, arginines, lysines or other complementary residues in the "a" position on 
at least one pair of corresponding first and second monomer units ensures that the first 
strand and the second strand form a staggered parallel heterodimer coiled coil 
structure. 

2. A protein structure according to claim 1, wherein a first peptide monomer unit in the 
first strand extends beyond a corresponding second peptide monomer unit in the 
second strand in the direction of the strands, 

3. A protein structure according to any one of claims 1 to 2 in which at least one 
charged amino acid residue of a first peptide monomer unit is arranged to attract an 
oppositely-charged amino acid residue of a second peptide monomer unit. 

4. A protein structure according to claim 3 in which the charged amino acid residue is in 
an end portion of the first peptide monomer unit which extends beyond the 
corresponding second peptide monomer unit in the second strand. 

5. A protein structure according to any one of the preceding claims in which at least one 
strand consists solely of first or second peptide monomer units respectively. 

6. A protein structure according to any one of the preceding claims wherein one or more 
of the other "a 7 ' positions of the first and second monomer units is a hydrophobic 
residue. 
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7. A protein structure according to claim 6, wherein the hydrophobic residue is selected 
from isoleucine or valine. 

8. A protein structure according to any one of the preceding claims having a leucine at 
one or more of the "d" positions of the first and second monomer units. 



9. A protein structure according to any one of the preceding claims having 
oppositely-charged or otherwise complementary residues at positions g and e of 
respective monomer units. 

10. A protein structure according to claim 9 in which the oppositely-charged residues are 
glutamic acid and lysine residues or arginine and aspaitic acid residues, or synthetic 
derivatives of these amino acid residues. 

11. A protein structure according to any preceding claim in which the structure is 
stabilised by pairs of asparagine, arginine, lysine or other complementary residues 
provided by corresponding first and second peptide monomer units. 

12. A protein structure according to any preceding claim which is arranged to form a 
tubular structure. 



13. A protein structure according to claim 12 in which the repeat motifs are offset by two 
or more amino acid positions in sequence whereby the peptide monomer units form a 
cylinder. 

14. A protein structure according to any preceding claim in which the first and second 
peptide monomer units have the sequence. 

a) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-pl) and 
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KIRALKAKNAHLKQEI AALEQE I AALEQ (SAF-p2) respectively; or 
K I AALKQK1 AALKQE I DALE YENDALEQ (SAF-pl A) and 

d) KIRALKWKNAHLKQE I AALEQE IAALEQ (SAF-p2C) respectively; or 

e) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-plC) and 

f) KIRALKWKNAHLKQE I AALEQE I AALEQ (SAF-p2C) respectively. 

15. A peptide monomer unit for use in preparing a protein structure the peptide monomer 
unit having an amino acid sequence selected from: 

a) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-pl); 

b) KIRALKAKNAHLKQE I AALEQE IAALEQ (SAF-p2); 
C) KIAALKQKIAALKQEIDALEYENDALEQ (SAF-pl A); 

d) KI RALKWKNAHLKQE I AALEQE IAALEQ (SAF-p2Q and 

e) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-plC). 

16. A protein structure according to any one of claims 1 to 14 or a peptide monomer unit 
according to claim 15 wherein at least one amino acid residue is derivatised. 

17. A branching self-assembling fibre comprising two or more protein structures 
according to any one of claims 1 to 1 1, coupled together to form a T-shaped 
conjugated structure. 
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The branching self-assembling fibre of claim 17, wherein at least one of the protein 
structures comprises one or more central cysteine residues, and at least one other 
protein structure comprises a terminal cysteine residue. 

A method of producing protein structures, the method comprising providing a mixture 
of first and second monomer units which associate to form a protein structure 
according to any one of claims 1 to 14, wherein the first and second monomer units 
comprise the heptad repeat motif (abcdefg) and/or the hendecad repeat motif 
(abcdefghijk). 

A method according to claim 19 in which the protein structure is derivatisecL 

A method according to claim 19 or 20 in which the protein structure is stabilised by 
cross-linking. 

A protein fibre produced by an association of protein structures according to any one 
of claims 1 to 14. 

A kit for making a protein structure, the kit comprising first and second peptide 
monomer units which associate to form a protein structure according to any one of 
claims 1 to 14 or a protein fibre according to claim 22, wherein the first and second 
monomer units comprise the heptad repeat motif (abcdefg) and/or the hendecad repeat 
motif (abcdefghijk). 

A two dimensional grid comprising a protein structure according to any one of claims 
1 to 14 or a protein fibre according to claim 22. 

A three dimensional matrix comprising a protein structure according to any one of 
claims 1 to 14 or a protein fibre according to claim 22. 

A matrix according to claim 25 which is arranged to assemble in solution. 
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28. 

29. 
30. 



31. 
32. 

33. 
34. 



A matrix according to claim 25 or claim 26, wherein one or more binders is fused to 
the protein structure, wherein the one or more binders are aligned to give high 
avidities for one or more target entities. 

A matrix according to any one of claims 25 to 27 which is arranged to bind one or 
more target entities. 

A matrix according to claim 28 which is arranged to bind viruses. 

A method of forming a matrix according to any one of claims 25 to 29 in which a 
mixture of separate first and second monomer units is provided, wherein the first and 
second monomer units comprise the heptad repeat motif (abcdefg) and/or the 
hendecad repeat motif (abcdefghijk) and are caused to associate to form a plurality of 
protein structures according to any one of claims 1 to 14, wherein the protein 
structures assemble to form a three-dimensional matrix. 

A method according to claim 30 in which the matrix is formed in situ. 

A method for controlling the production of a synthetic polymers comprising 
assembling a protein structure in accordance to any one of claims 1 to 14 in 
association with the polymer. 

A method according to claim 32 in which the protein structure is removed after 
synthesis of the polymer. 

A tip for use in Atomic Force Microscopy comprising a protein structure according to 
any one of claims 1 to 14. 
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Peptides 

This invention relates to protein fibre formation and in particular to methods of producing 
protein fibres to form a protein structure comprising a plurality of first polypeptide units 
arranged in a first polypeptide strand and a plurality of second polypeptide units arranged in 
a second polypeptide strand, the strands preferably forming a coiled coil structure, and in 
which a first polypeptide unit in the first strand extends beyond a corresponding second 
polypeptide unit in the second strand in the direction of the strands. 
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Banner & Witcoff Ref. No. 00487.00012 
Client Ref. No. P 1 00087US-PCT-AGT-jmj 

JOINT DECLARATION FOR PATENT APPLICATION 

As the below named inventors, wc hereby declare that: 

Our residence, post office address and citizenship are as stated below next to our names; 

We believe we are the original, first and joint invemors of the subject matter which is claimed and for which a patent is 
sought on the invention entitled PROTEIN STRUCTURES AND PROTEIN FIBRES , the specification of which 
I I is attached hereto. i 
E>3 was filed on March 18. 2002 as Application Serial Number 10/088, 417 and w^s amended on 

(if applicable). j 

£g] was filed under the Patent Cooperation Treaty (PCT) and accorded International Appl Jcation 

No. PCT/GBQQ/03576 , filed September 18. 2000 . and amended on (if an; r). 

We hereby state that we have reviewed and understand the contents of the above-identified specification, including the 
claims, as amended by any amendment referred to above. f 

We hereby acknowledge the duty to disclose information which is material to patentability in ; jeeordance with Title 37, 
Code of Federal Regulations, § 1.56(a). 



Prior Foreign Application(s) 

We hereby claim foreign priority benefits under Title 35, United States Code, §119 of any djreign application(s) for 
patent or inventor's certificate listed below and have also identified below any foreign application(s 
certificate having a filing date before that of the application on which priority is claimed: 



Country 


Application No. 


Date of Filing 
(day month year) 


Date of Issue 
(day month year) 


Priority Claimed 
Under 35 U.S.C. 
§119 


Great Britain 


9922013.9 


17 September 1999 




Yes 



Prior United States Provisional Application(s) 

We hereby claim priority benefits under Title 35, United States Code, § 1 19(e)(1) of any U.S j provisional application 
listed below: 



for patent or inventor's 



U.S. Provisional Application No. 



Date of Filing 
(day month year) 



Priorjjty 
Under 35 I 



Claimed 
S.C.§ 119(e)(1) 



Prior United States Application(s) 

We hereby claim the benefit under Title 35, United States Code, § 1 20 of any United States aj 
and, insofar as the subject matter of each of the claims of this application is not disclosed in the prior U: 
the manner provided by the first paragraph of Title 35, United States Code, §112, wc acknowledge the 
information as defined in Title 37, Code of Federal Regulations, § 1.56(a) which occurred between Til 
application and the national or PCT international filing date of this application: 



lication(s) listed below 
ted States application in 
uty to disclose material 
filing date of the prior 



Application Serial No. 


Date of Filing 
(Day, Month, Year) 


Status 
Pending 


p- Patented, 
, Abandoned 
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Power of Attorney 



And we hereby appoint, both jointly and severally, as our attorneys with full power of subsl ration and revbe^ation, to 



prosecute this application and to transact all business in the Patent and Trademark Office connected 



Customer Number: 22907 (WDC) 



fliiiiil 



Please address all correspondence and telephone communications to the address and telephcftral 

Number. 



22907 



rs at: 



Btomer 



We hereby declare that all statements made herein of our own knowledge are true and that Jl statements made on 
information and belief are believed to be true; and further that these statements were made with the knowledge that willful 
false statements and the like so made are punishable by fine or imprisonment, or both, under SectionPlOOl of Title 18 of the 
United States Code and that such willful false statements may jeopardize the validity of the application or any patent issuing 
thereon. 



Signature 

Full Name of First Inventor 

FirsfTHvenT^ame Sccorjl Giver^Nan 

Residence Great Britain Citizenship Great Britain 

Post Office Address Uru versi ty_of Sussex JFalmer, Brigh ton^^^jg^N 1 9C}C h Great Britain 



Signature. 

Full Name of Second Invent 



Residence Great Britain 

Post Office Address Vniffi - ft ity of Sussex, Falmer. Brighton. 



Signature 

Full Name of Third Inventor 




Residence Great Britain 

Post Office Address University of Sussex. F aimer, Brig hton, 



Signature 

Full Name of Third Inventor 

Residence Great Britain 
Post Office Address Elfordlea. Mill La ne 

es 



Banner & Witcoff, Ltd. 



Page 2 of 2 



:l o o h; 3 »+ :i :/ ,,. o e» e» cp e? 
WlWOTRec'd 22 JUL 2002 



SEQUENCE LISTING 

<110> Woolfson, Derek 
Washaw, John 
Pandya , Maya 
Colyer, John 

<120> PROTEIN STRUCTURES AND PROTEIN FIBRES 



<130> 000487.00012 

<140> 10/088,417 
<141> 2002-03-18 

<150> PCT/GB00/03576 
<151> 2000-09-18 

<150> GB9922013.9 
<151> 1999-09-17 

<160> 18 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 28 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic peptides 
<400> 1 

Lys He Ala Ala Leu Lys Gin Lys He Ala Ser Leu Lys Gin Glu He 

X 5 10 15 

Asp Ala Leu Glu Tyr Glu Asn Asp Ala Leu Glu Gin 
20 25 

<210> 2 
<211> 28 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic peptides 
<400> 2 

Lys He Arg Ala Leu Lys Ala Lys Asn Ala His Leu Lys Gin Glu He 

1 5 ' 10 15 

Ala Ala Leu Glu Gin Glu He Ala Ala Leu Glu Gin 
20 25 

<210> 3 



- 1 - 



.1 o n a e Mki 7 u o y s?: h o a 



<211> 28 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic peptides 
<400> 3 

Lys He Ala Ala Leu Lys Gin Lys He Ala Ala Leu Lys Gin Glu He 

1 5 10 15 

Asp Ala Leu Glu Tyr Glu Asn Asp Ala Leu Glu Gin 

20 ■ 25 

<210> 4 
<211> 28 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic peptides 
<400> 4 

Lys He Arg Ala Leu Lys Trp Lys Asn Ala His Leu Lys Gin Glu He 

15 10 15 

Ala Ala Leu Glu Gin Glu He Ala Ala Leu Glu Gin 
20 25 

<210> 5 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic peptides 
<400> 5 

Tyr Gly Pro Gly Glu He Ala Ala Leu Glu Gin Glu Asn Ala Ala Leu 

! J 5 10 15 

Glu Gin 



<210> 6 
<211> 28 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic peptides 
<400> 6 

Lys He Ala Ala He Lys Gin Lys He Ala Ala Leu Lys Gin Glu He 

! 5 10 15 

Asp Ala Leu Glu Tyr Glu Asn Asp Ala Leu Glu Gin 
20 25 
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<210> 7 
<211> 29 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic peptides 



<400> 7 

Lys lie Ala Ala Leu Lys Gin lie Cys lie Ala Ala Leu Lys Gin Glu 

15 10 15 

lie Asp Ala Leu Glu Tyr Glu Asn Asp Ala Leu Glu Gin 
20 25 

J 

<210> 8 
<211> 28 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> synthetic peptides 
<400> 8 

Lys lie Ala Ala Leu Lys Gin Lys 

1 5 
Asp Ala Leu Glu Tyr Glu Asn Asp 
20 



lie Ala Ser Leu Lys Gin Glu lie 

10 15 
Ala Leu Glu Gin 
25 



<210> 9 
<211> 28 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic peptides 



<400> 9 

Lys lie Ser Ala Leu Lys Trp Lys 

1 ■ 5 
Ala Ala Leu Glu Gin Glu lie Ala 
20 



Asn Ala Ser Leu Lys Gin Glu lie 

10 15 
Ala Leu Glu Gin 
25 



<210> 10 
<211> 28 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic peptides 
<400> 10 

Lys lie Arg Ala Leu Lys Trp Lys Asn Ala His Leu Lys Gin Glu lie 

15 10 15 

Ala Ala Leu Glu Gin Glu lie Ala Ala Leu Glu Gin 
20 25 
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<210> 11 

<211> 29 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> synthetic peptides 
<400> 11 

lie Cys lie Arg Ala Leu Lys Ala 

1 5 
He Ala Ala Leu Glu Gin Glu He 
20 



Lys Asn Ala His Leu Lys Gin Glu 

10 15 
Ala Ala Leu Glu Gin 
25 



<210> 12 
<211> 29 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic peptides 



<400> 12 

He Thr He Arg Ala Leu Lys Cys 

1 5 
He Ala Ala Leu Glu Gin Glu He 
20 



Lys Asn Ala His Leu Lys Gin Glu 

10 15 
Ala Ala Leu Glu Gin 
25 



<210> 13 

<211> 28 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> synthetic peptides 
<400> 13 

Glu He Asp Ala Leu Glu Tyr Glu 

1 5 
Ala Ala Leu Lys Gin Lys He Ala 
20 



Asn Asp Ala Leu Glu Gin Lys He 

10 15 
Ser Leu Lys Gin 
25 



<210> 14 
<211> 7 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> synthetic peptides 
<400> 14 

Leu Ala Ala Leu Ala Ala Ala 
1 5 
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<210> 15 
<211> 43 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic peptides 
<400> 15 

Lys lie Ala Ala Leu Lys Gin Lys lie Ala Ser Leu Lys Gin Glu lie 

15 10 15 

Asp Ala Leu Glu Tyr Glu His His Asp Ala Leu Glu Gin Lys lie Ala 

20 25 30 

Ala Leu Lys Gin Lys lie Ala Ser Leu Lys Gin 
35 40 

<210> 16 
<211> 42 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic peptides 
<400> 16 

Glu lie Ala Ala Leu Glu Gin Glu lie Ala Ala Leu Glu Gin Lys lie 

15 10 15 

Arg Ala Leu Lys Ala Lys Gin Ala Lys Leu Lys Gin Glu He Ala Ala 

20 25 30 

Leu Glu Gin Glu He Ala Ala Leu Glu Gin 
35 40 

<210> 17 
<211> 29 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic peptides 
<400> 17 

Glu He Asp Ala Leu Glu Tyr Glu Gin Asn Asp Ala Leu Glu Gin Lys 

15 10 15 

He Ala Ala Leu Lys Gin Lys He Ala Ser Leu Lys Gin 
20 25 

<210> 18 
<211> 29 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic peptides 
<400> 18 
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Lys He Arg Ala Leu Lys Ala Lys Phe Asn Ala His Leu Lys Gin Glu 

15 10 15 

He Ala Ala Leu Glu Gin Glu He Ala Ala Leu Glu Gin 

20 25 
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□ Page(s) 
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of 
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of 
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